A Metaheuristic Technique for Cluster-Based Feature Selection of DNA Methylation Data for Cancer

نویسندگان

چکیده

Epigenetics is the study of phenotypic variations that do not alter DNA sequences. Cancer epigenetics has grown rapidly over past few years as epigenetic alterations exist in all human cancers. One these methylation; an process regulates gene expression and often occurs at tumor suppressor loci cancer. Therefore, studying this methylation may shed light on different functions cannot otherwise be interpreted using changes occur Currently, microarray technologies; such Illumina Infinium BeadChip assays; are used to extremely large number varying loci. At each site, a beta value (β) reflect intensity. clustering data from various types cancers lead discovery partitions can help objectively classify well identify relevant without user bias. This proposed Nested Big Data Clustering Genetic Algorithm (NBDC-GA); novel evolutionary metaheuristic technique perform cluster-based feature selection based sites. The efficacy NBDC-GA was tested real-world sets retrieved Genome Atlas (TCGA); cancer genomics program created by National Institute (NCI) Human Research Institute. performance then compared with recently developed Immuno-Genetic (IGA) same sets. outperformed IGA terms convergence performance. Furthermore, produced more robust configuration while simultaneously decreasing dimensionality features maximum 67% 94.5% for individual type collective cancer, respectively. also able two chromosomes highly contrasting methylations activities were previously linked

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature selection for DNA methylation based cancer classification

Molecular portraits, such as mRNA expression or DNA methylation patterns, have been shown to be strongly correlated with phenotypical parameters. These molecular patterns can be revealed routinely on a genomic scale. However, class prediction based on these patterns is an under-determined problem, due to the extreme high dimensionality of the data compared to the usually small number of availab...

متن کامل

Cluster-based pattern discrimination: A novel technique for feature selection

The study of feature selection methods has become an area of intensive research in pattern recognition. In this paper, a new feature selection approach, called cluster-based pattern discrimination (CPD), is introduced. Classes are independently partitioned into clusters to group together similar patterns: a different subspace is defined for each cluster by determining an optimal subset of featu...

متن کامل

Feature Selection and Extraction Framework for DNA Methylation in Cancer

Feature selection methods for cancer classification are aimed to overcome the high dimensionality of the biomedical data which is a challenging task. Most of the feature selection methods based on DNA methylation are time consuming during testing phase to identify the best pertinent features subset that are relevant to accurate prediction. However, the hybridization between feature selection an...

متن کامل

A New Hybrid Feature Subset Selection Algorithm for the Analysis of Ovarian Cancer Data Using Laser Mass Spectrum

Introduction: Amajor problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. The chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. Laser mass spectrometry is a valuable tool for extracting the proteomic patterns from biological samples. A major challenge in extracting such ...

متن کامل

a new approach to credibility premium for zero-inflated poisson models for panel data

هدف اصلی از این تحقیق به دست آوردن و مقایسه حق بیمه باورمندی در مدل های شمارشی گزارش نشده برای داده های طولی می باشد. در این تحقیق حق بیمه های پبش گویی بر اساس توابع ضرر مربع خطا و نمایی محاسبه شده و با هم مقایسه می شود. تمایل به گرفتن پاداش و جایزه یکی از دلایل مهم برای گزارش ندادن تصادفات می باشد و افراد برای استفاده از تخفیف اغلب از گزارش تصادفات با هزینه پائین خودداری می کنند، در این تحقیق ...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computers, materials & continua

سال: 2023

ISSN: ['1546-2218', '1546-2226']

DOI: https://doi.org/10.32604/cmc.2023.033632